Research on the Deep Deterministic Policy Algorithm Based on the First-Order Inverted Pendulum
نویسندگان
چکیده
With the mature development of artificial intelligence technology, application intelligent control algorithms in systems has become a trend to meet high-performance requirements modern society. This paper proposes deep deterministic policy gradient (DDPG) controller design method based on reinforcement learning improve system performance. Firstly, optimal DDPG algorithm is derived from Markov decision process and Actor–Critic algorithm. Secondly, order avoid local optima traditional systems, capacity settlement experience pool are adjusted absorb positive accelerate convergence complete efficient training. In response, solve overestimation Q value DDPG, overall structure Critic network changed shorten period at low rates. Finally, first-order inverted pendulum was constructed simulation environment verify effectiveness PID, improved DDPG. The results reveal that faster response disturbances, smaller displacement, angular displacement pendulum. further proves better stability stronger anti-interference ability recovery. provides certain reference for systems.
منابع مشابه
survey on the rule of the due & hindering relying on the sheikh ansaris ideas
قاعده مقتضی و مانع در متون فقهی کم و بیش مستند احکام قرار گرفته و مورد مناقشه فقهاء و اصولیین می باشد و مشهور معتقند مقتضی و مانع، قاعده نیست بلکه یکی از مسائل ذیل استصحاب است لذا نگارنده بر آن شد تا پیرامون این قاعده پژوهش جامعی انجام دهد. به عقیده ما مقتضی دارای حیثیت مستقلی است و هر گاه می گوییم مقتضی احراز شد یعنی با ماهیت مستقل خودش محرز گشته و قطعا اقتضاء خود را خواهد داشت مانند نکاح که ...
15 صفحه اولthe role of task-based techniques on the acquisition of english language structures by the intermediate efl students
this study examines the effetivenss of task-based activities in helping students learn english language structures for a better communication. initially, a michigan test was administered to the two groups of 52 students majoring in english at the allameh ghotb -e- ravandi university to ensure their homogeneity. the students scores on the grammar part of this test were also regarded as their pre...
15 صفحه اولthe u.s. policy in central asia and its impact on the colored revolutions in the region (the case study of tulip revolution in kyrgyzstan)
چکیده ندارد.
15 صفحه اولon the effects of pictorial clues on the efl learners listening comprehension development
the following null hypothesis was proposed: there is no significant difference between the efl students listening comprehension development receiving pictorial cues and those receiving no cuse. to test the null hypothesis, 52 male and femal freshmen students of medicine studing at iran university of medical scinces were randomly selected from a total population of 72 students. to ensure that th...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13137594